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USE OF INFRARED SPECTROSCOPY IN GENOTYPIC ANALYSIS 

CROSS REFERENCE TO RELATED APPLICATIONS 
[0001] This application claims the benefit of U.S. Provisional Application 
Serial No. 60/269,474, filed February 16, 2001, the disclosure of which is herein 
incorporated by reference. 

FIELD OF THE INVENTION 
[0002] The present invention relates to methods and apparatuses for 
comparing genotypes of biological tissue samples. Specifically, infrared spectroscopy 
is used for comparing genotypes of soybean tissue samples, and more particularly for 
determining resistance to soybean cyst nematode. 

BACKGROUND OF THE INVENTION 
[0003] Chemical bonds in organic molecules absorb energy in the near 
infrared region of the spectrum. Near infrared reflectance spectroscopy involves 
measuring the amount of light reflected by the substance to determine the amount of 
light being absorbed by the substance. Different types of carbon bonds absorb energy 
at different wavelengths. Determining the amount of light absorbed at a certain 
wavelength provides insight into which functional groups are in the substance and a 
quantitative measure of compounds containing these functional groups can be 
determined. 

[0004] Near infrared spectroscopy is also employed in chemical imaging, and 
uses a tunable light source external to the experimental subject to determine its 
chemical composition. Typically, measurements can be made quickly on all types of 
samples. 

[0005] Infrared thermometers have been used in agriculture for measuring 
temperatures and determining the need for the irrigation of crops. See, for example, 
U.S. Patent Nos. 4,301,682 and 4,998,826. 




[0006] Soybean {Glycine max (L.) Merr) is an important agricultural crop 
grown worldwide. Soybean comprises 52 percent of world oilseed production with 
155.1 million metric tons, 46 percent of which were produced in the United States 
(http://wwwMnitedsoybean.org/soystats2000/). In 1999, the United States produced 
71.9 million metric tons on 29.9 million hectares. The importance of soybean 
continues to grow as new uses for soybean products are constantly being developed. 
Soybean has also long been a staple food in many diets. Uses of soybean range from 
industrial uses such as inks and lubricants to foods and food additives. 

[0007] As the number of hectares of soybeans grown increases, so does the 
pressure exerted on soybean by the pathogen, Soybean Cyst Nematode (SCN) 
(Heterodera glycines Inchinohe). Currently, soybean growers practice crop rotation 
utilizing non-host species to reduce reproduction and populations of SCN. Genetic 
resistance to SCN has also been identified in soybean and implemented in soybean 
breeding programs for the development of SCN resistant soybean cultivars. While 
crop rotation is an effective measure in combating SCN, the use of SCN resistant 
soybean cultivars allows growers to increase soybean acreage without sacrificing 
. yield to the SCN pathogen. 

[0008] Breeding soybean for resistance to SCN involves the use of genetically 
resistant cultivars whose source of resistance is derived from plant introductions (Pis). 
Many of these Pis exhibit poor phenotypes. Crosses of these PI lines with 
agronomically desirable, but SCN susceptible phenotypes, result in populations of 
mixed resistant and susceptible genotypes. 

[0009] Soybean cyst nematode was first characterized in Asia, but not 
discovered in the United States until 1954 (Winstead et al. 1955). Since its discovery 
in North Carolina, SCN has spread throughout all soybean-producing regions in the 
United States. Yield losses due to SCN in the United States were estimated at 7.6 
million metric tons in 1998 (Wrather et al., 2000). This represented an increase of 1.6 
million metric tons over 1998 estimates. Soybean cyst nematode is the most 
damaging pest of soybean today. It is a soil borne pest, which makes it able to spread 
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with the movement of contaminated soil to uncontaminated soil via soil erosion or 
farm implements (Riggs, 1977). 

[0010] Soybean cyst nematodes are prolific reproducers with each female 
being able to produce between 200 and 500 eggs. A cyst is formed by the SCN 
female to protect the eggs from the environment (Endo, 1964). Once the cyst is 
broken the eggs are released and they immediately begin to hatch if environmental 
conditions are conducive to nematode survival. The cysts are easily recognizable on 
the roots of host plants. Hosts of SCN include soybean, annual lespedeza 
(Kummerowia striate (Thunb.) H. & A.), common vetch (Vicia saliva L.), adzuki bean 
(Vigna angularis (Willd.) Ohwi & Ohashi), white lupine (Lupinus albus L.), and 
cowpeas {Vigna savi) (Epps and Chambers, 1962). 

[0011] Plants are continually attacked by a diverse range of phytopathogenic 
organisms. These organisms cause substantial losses to all crops each year. 
Traditional approaches for control of plant diseases have been the use of chemical 
treatment and the construction of interspecific hybrids between resistant crops and 
their wild-type relatives as sources of resistant germplasm. However, environmental 
and economic concerns make chemical pesticides undesirable, while the traditional 
interspecific breeding is inefficient and often cannot eliminate the undesired traits of 
the wild species. Thus, the discovery of pest and pathogen-resistant genes provides a 
new approach to control plant disease. 

[0012] Nematode infection is prevalent in many crops. Nematicides such as 
Aldicarb® and its breakdown products are known to be highly toxic to mammals. As 
a result, government restrictions have been imposed on the use of these chemicals. 

[0013] Several genes responsible for disease resistance have been identified 
and isolated from plants. See Staskawicz et al. (1995) Science 268:661-667. Recently, 
the sugar beet Hsl.sup.pro-1 gene that confers resistance to the beet cyst nematode has 
been cloned. See Cai et al. (1997) Science 275:832-834; and Moffat (1997) Science 
275:757. Transformation of plants or plant tissues with the resistance genes can 
confer disease resistance to susceptible strains. See, for example, PCT Publication 
WO 93/19181; and Cai et al. (1997) Science 275:832-834. 
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[0014] Near infrared spectroscopy (NIRS; 1000 to 3000 nm) has been used 
extensively for measuring moisture, protein, starch, and oil contents of seeds of 
several crop species (Osborne and Fearn, 1986). Plant breeders have successfully 
used NIRS to select individuals with superior seed or forage quality. However, 
selections based on genotypic markers have not been extensively tested. 
Traditionally, calibration equations are developed by correlating spectral data 
generated by NIRS, with reference data generated from substance analysis using wet 
chemistry. Calibration equations developed for the purpose of distinguishing 
genotypes must be based on reference values, which accurately characterize genotypic 
differences in the calibration population. 

[0015] Rutherford (1998) examined the use of NIRS to determine resistance 
of sugarcane (Saccharum spp. hybrids) to stalk borer (Eldana saccharina Walker). 
Budscale extracts were used for NIRS analysis as well as analysis by high 
performance liquid chromatography (HPLC). Reference values were based on field 
bioassay measurements and resistance was categorized on a scale of one to nine. 
Limited success was obtained with the study in distinguishing resistance to stalk borer 
from susceptibility by using equations developed from modified partial least squares 
regression. An attempt to develop an equation based on one or few peaks using a 
forward stepwise multiple linear regression proved unsuccessful suggesting pathogen 
resistance is biochemically complex. It is also worth noting that in this study, 
prediction of stalk borer resistance in sugarcane based on NIRS was more accurate 
than prediction based on HPLC. 

[0016] A study by Delwiche et al. (1999) also suggested the ability of NIRS to 
make genotypic distinctions. This study looked at the ability of NIRS to distinguish 
wheat (Triticum aestivum L.) lines containing wheat-rye (Secale cereale L.) 
translocations from lines that did not contain the translocation using ground seed. A 
discriminate analysis was performed on the spectral data to classify samples as either 
having the translocation or not having the translocation. Classification accuracy 
ranged from 78 to 99 percent. However, difficulties arose in correctly classifying 
near-isogenic lines differing only by the translocation and lines that were 
heterogeneous for the translocation. 
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[0017] Genetic resistance to SCN within soybean germplasm has emerged as 
the forerunner to overcoming this pathogen. While soybean breeders have had much 
success in developing SCN resistant germplasm, current methods for selecting this 
germplasm are labor and resource intensive. A challenge exists for identifying a 
method of screening germplasm for SCN resistance that does not significantly tax the 
resources of the modern soybean breeding program yet remains accurate in selecting 
desirable lines. 

[0018] Current methodology for screening soybean {Glycine max (L.) Merr) 
genotypes for resistance to soybean cyst nematode (Heterodera gylcines Ichinohe) 
(SCN) involve the use of a labor and resource intensive bioassay that can provide 
inconsistent results due to heterogeneous populations of SCN. Thus, the development 
of near infrared spectroscopy (MRS) as a monitoring and/or comparing system for 
screening soybean populations for SCN resistance would provide an improvement in 
the efficiency of the breeding process by saving time and money over current 
bioassay methods. 

SUMMARY OF THE INVENTION 
[0019] The present invention relates to methods and apparatuses for 
comparing genotypes of tissue samples. 

[0020] The invention further relates to the use of infrared spectroscopy for 
comparing genotypes of tissue samples. 

[0021] The invention further relates to the use of infrared spectroscopy for 
comparing genotypes of plant tissue samples. 

[0022] The invention additionally relates to the use of infrared spectroscopy 
for comparing genotypes of soybean plant tissue samples. 

[0023] The invention still further relates to the use of infrared spectroscopy 
for determining resistance to soybean cyst nematode. 
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[0024] The invention yet further relates to plant breeding programs, plants and 
plant derivatives, and seeds selected and/or generated from using these methods and 
apparatuses. 

BRIEF DESCRIPTION OF THE DRAWINGS 
[0025] Figure 1 illustrates the entire spectrum of 77 SCN resistant and 
susceptible samples with correlation between FI and spectra. 

[0026] Figure 2 illustrates a close-up of spectra showing variation for SCN 
resistance and susceptibility plotted against correlation for FI. 

[0027] Figure 3 illustrates the entire spectrum of SCN resistant and 
susceptible samples averaged by groups of 11 with correlation of FI for each 
wavelength. 

[0028] Figure 4 illustrates a close-up of spectra showing separation between 
SCN resistance and susceptibility. 

[0029] Figure 5 illustrates another close-up of spectra showing separation 
between SCN and susceptibility. 

[0030] Figure 6 illustrates the correlation between spectra and inoculation 
treatment in leaf tissue harvested after 60 days. 

[0031] Figure 7 illustrates the correlation between spectra and SCN resistance 
in leaf tissue harvested after 60 days. 

[0032] Figure 8 illustrates the correlation between spectra and inoculation 
treatment in root tissue harvested after 60 days. 

[0033] Figure 9 illustrates the correlation between spectra and SCN resistance 
in root tissue harvested after 60 days. 

[0034] Figure 10 illustrates the averaged spectra across 3 reps for 60 days root 
tissue with correlation between FI and spectra. 
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[0035] Figure 11 illustrates the averaged spectra across 3 reps for 60 day leaf 
tissue with correlation between FI and spectra. 

[0036] Figure 12 illustrates the averaged spectra across 3 reps for 60 day root 
tissue; close-up of separation between SCN resistant and susceptible samples. 

DETAILED DESCRIPTION OF THE INVENTION 
[0037] Infrared spectroscopy may be employed in the comparison and 
analysis of a number of genotypic traits for a variety of tissue samples. Different 
genotypes in any given genetic locus are reflected in the ultimate chemical 
composition of a given biological sample. Biological samples may be any suitable 
type of plant, animal, bacterial, fungal, or viral sample. 

[0038] Examples of genotypes which may be analyzed in this manner are 
genotypes which may be comparatively discerned using infrared spectroscopy, which 
may be general or specific traits, such as, for example, disease resistance, disease 
susceptibility, enhanced nutritional composition, increased fiber production, increased 
membrane transport capabilites, increased amino acid content, and the like. Preferred 
for the practice of the present invention is disease resistance in plants, and preferably 
the resistance of soybean plants to soybean cyst nematode. 

[0039] Currently, MRS is used in agricultural programs to analyze protein 
and fatty acids. The fundamental principles of NIRS could be used in selection of 
lines for a number of traits including disease resistance, quality, and physiological 
traits. This principle can be applied to SCN resistance screening by considering the 
host-pathogen interaction. Resistant cultivars do not allow reproduction of SCN on 
their roots. Resistance genes in soybean code for the production of a plant defense 
response. Several different responses have been detected in soybean. Some of these 
include the accumulation of peroxidases, polyphenol oxidases, superoxide dismutase, 
proteinase inhibitors, chitinases, and phytoalexins (Qiu et al., 1997). These responses 
could be detected through NIRS in resistant cultivars and not in susceptible ones. 
Roberts et al. (1987) used NIRS to quantify the mold content of hay by analyzing the 
chitin content in the moldy hay. Nillson et al. (I 994) used this same principle to 
detect anther smut disease (Microbotryum violacem) in Silene dioca. 
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[0040] NEPS has been successfully used to distinguish between cultivars of 
sugarcane (Saccharwn spp. hybrids) that were resistant and susceptible to Stalk Borer 
(Eldana saccharina Walker) (Rutherford, 1998). Wavelengths corresponding to 
chlorogenic acid were identified in sugarcane as having different MR absorptions for 
resistant and susceptible cultivars. This study used budscale extracts for analysis and 
regressed spectral data against bioassay data to define equations that could be used to 
predict sugarcane samples with unknown resistance. Delwiche et al. (1999) used 
MRS to identify wheat (Triticum aestivum L.) cultivars that contained either the 
1AL.1RS or 1BL.1RS wheat-rye (Secale cereale L.) translocation using ground wheat 
seeds. A discriminate analysis using MRS was used to make the distinction between 
cultivars with the translocation from cultivars without. 

[0041] Near infrared spectroscopy is an established method for analyzing 
plant tissue. However, little data exists on the ability of MRS to function as a 
selection tool for genetic pathogen resistance. Near infrared spectroscopy is not a 
new technology as it has been used extensively in the last half of this century for 
measuring moisture, protein, starch, and oil contents of seeds of several crop species 
. (Osborne and Fearn, 1986). More recently, MRS has been useful in measuring the 
quality of forages (Roberts et al., 1987) and also in quality control in the food 
processing industry (Bewig, 1992). Importance of MRS in determining organic 
constituents is that it is much faster, more repeatable, and easier than traditional 
methods that involve wet chemistry. 

[0042] The principle behind MRS is that many of the chemical bonds in 
organic molecules will absorb energy in the near infrared region of the light spectrum 
(Osborne and Feam, 1986). By measuring the amount of energy transmitted or 
reflected by the functional group, one can determine the amount of energy absorbed 
by the substance. This can be correlated to the type of bonds that exist in the 
substance and the different constituents of the substance and their concentrations can 
be determined. 

[0043] While the use of plant tissue samples is feasible in a plant breeding 
program, the use of mature seeds as a non-destructive analytical means is preferred. 
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The instant invention examined the ability of NIRS to distinguish SCN resistance 
from susceptibility using ground soybean seed. 

[0044] Samples are analyzed using NIRS, and the resulting spectra and data 
are compared to appropriate controls. Calibration equations were developed using 
different types of reference values for classifying SCN resistance. Spectral data is 
visually analyzed and peaks are identified. For example, SCN resistant lines showed 
a different spectral trend than susceptible lines in the resulting spectra. 

[0045] The data derived from the spectral analysis is then further analyzed 
using appropriate computer programs, for example. 

[0046] The following Examples further exemplify the invention. 

EXAMPLES 
Example 1 

Preparation of Plant Materials 
Plant Materials: 

[0047] Two-hundred fifteen experimental soybean cultivars were selected 
from the University of Missouri's soybean breeding program. These cultivars were 
selected in the F 4:5 &6 generations from preliminary yield trials grown in 1999. The 
cultivars were divided into three groups based on their pedigrees. Seventy-eight 
experimental cultivars resulted from crosses between two SCN resistant parents and 
were grouped together. Eighty-two experimental cultivars resulted from crosses 
between one SCN resistant and one SCN susceptible parent and were grouped 
together. The final 55 experimental cultivars resulted from crosses between two SCN 
susceptible parents and were grouped together. Seed quality of these cultivars was 
generally poor as many seeds were infested with purple seed stain (Cercospora 
kikuchii) as well as phomopsis seed decay (Phomopsis longicolla Hobbs). Five 
hundred seeds were visually selected from each cultivar to ensure that the best quality 
seed was analyzed. 

SCN Bioassay: 
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[0048] The SCN bioassay was performed in the greenhouse using established 
methods (Rao Arelli et al., 1991 and Arelli et al., 1999). A SCN Race 3 isolate was 
used for inoculation of the experimental lines. Twenty-five seeds from each line were 
treated with Captan (N-trichloromethylthio-4-cyclohexene-l,2-dicarboximide) to aid 
in germination and germinated in germination bags. Five seedlings were transplanted 
and inoculated with the SCN Race 3 isolate. Race determination was confirmed by 
reaction to the differential cultivars and to the susceptible cultivar 'Hutcheson' (Buss 
et al., 1988). The female index (FT) (formerly called index of parasitism, Golden et al. 
1970 and Schmitt and Shannon 1992) was used to evaluate the response of each 
cultivar to the SCN Race 3 isolate. Female index was calculated by dividing the 
numbers of cysts counted on an experimental line by the number of cysts counted on 
the SCN susceptible cultivar 'Essex' and multiplying by 1 00. Cultivars were then 
classified by their Fl as follows: Fl = 0-9, resistant; FI = 10-29, moderately resistant; 
Fl = 30-59, moderately susceptible; Fl = 60+, susceptible. 

[0049] The bioassay results showed a wide range of infection levels for lines 
with the same SCN resistance genotype. Because SCN is a genetically dynamic 
pathogen, gene mapping molecular marker studies would assist in providing 
genotypic information from breeding lines. The ability to use MRS to accurately 
predict SCN reaction in breeding lines rests on the ability to calibrate with static 
reference values that are characteristic of SCN resistance genotypes. 

NIR Spectroscopic Analysis: 

[0050] Approximately 20g of seed from each cultivar were ground using a 
Wiley benchtop grinder to pass through a 1-mm screen during the winter of 2000. 
Ground seed samples were stored in sealed plastic bags in a freezer at -20°C. 
Approximately 7g of ground seed from each cultivar were scanned with a Foss 
NIRSystems Model 6500 (Foss NIRSystems Silver Spring, MD). The instrument 
scanned 7g from each sample with radiation from 400 to 2498 nm, and log 
1/reflectance (log 1/R) was recorded at 2nm intervals. Spectral data were trimmed to 
only include NIR portion of the spectrum, which ranges from 1100 to 2498 nm, to 
eliminate variations between samples due to seed discoloration that appear in the 
visible region. 
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Plant Materials: 

[0051] Nine cultivars were selected for this experiment. These cultivars 
represented a wide range of genetic diversity for SCN resistance. Differential 
cultivars for SCN race isolate classification as proposed by Golden et al. (1970) were 
selected because most SCN resistant commercial soybean cultivars developed their 
SCN resistance from these cultivars. The differential cultivars Peking 1 , Tickett' 
(Brim and Ross, 1966), PI 90763, and PI 88788 along with PI 437654 because of its 
broad resistance to all known races of SCN. The cultivars Essex 1 and Magellan 1 
(Schapaugh et al., 1998) were selected as SCN susceptible checks to determine SCN 
infection levels. The final two experimentals were two near isogenic lines (NIL) 
derived from the cultivar 'Bedford'. One of the lines was SCN resistant and 
designated NIL1 the other was SCN susceptible and designated NIL2. 

[0052] Calibration equations were developed for the purpose of screening 
soybean germplasm with unknown reactions to SCN. Calibration equations using 
NIRS rely on accurate reference data. The SCN bioassay provided reference data for 
the basis of developing calibration equations for predicting SCN resistance in 
soybean. Female index was calculated for each experimental line using the 
susceptible cultivar Essex equal to 100. Essex had an average of 94 females per plant. 
The SCN. race isolate was confirmed as race 3 by a FI<10 on the differentials, PI 
90763, PI 88788, 'Peking', and 'Pickett' (Brim and Ross, 1966). Female index of the 
experimental cultivars ranged from 0.0 to 163.8. High levels of infection were 
confirmed by SCN susceptible controls. The SCN susceptible check cultivars had FIs 
of 78.72, 105.32, 163.83 for the cultivars 'Macon' (Nickell et al. 1996), 'Hamilton' 
(Nickell, 1989), and 'Hutcheson' (Buss et al. 1988), respectively. Of particular 
interest were the cultivars Macon and Hamilton as they were used as parents in 
several of the experimental cultivars containing one or both parents as susceptible to 
SCN. Some of these experimental lines exhibited transgressive segregation, as their 
FIs were lower than the FIs of Macon and Hamilton. Yue et al. (2000) suggested the 
possibility of Hamilton containing minor SCN resistance genes that contributed to its 
lower R. The data would support the possibility of Macon containing some minor 
SCN resistance genes as well as Hamilton- Twenty-five lines from crosses with both 
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parents being SCN susceptible received a classification of moderately susceptible 
based on their FI. 

[0053] It is also important to note that many of the lines resulting from crosses 
between one parent that was SCN resistant and one that was SCN susceptible were 
still segregating for SCN resistance. This suggested that these lines existed as 
heterogeneous populations containing both SCN resistant and SCN susceptible 
individuals. This was evident by the wide range of FIs for individual replications in a 
given line. Female index used for classification of lines was based on the average of 
all five replications for each line. 

Experimental Design: 

[0054] Three randomized complete blocks were designed, one for each plant 
tissue harvesting date, 30, 60, and 90d following SCN inoculation. There were three 
replications in each block. Each replication consisted of two treatments, one 
treatment with SCN inoculation and the other treatment without SCN inoculation. 
This design provided 18 entries per replication and 54 entries per block for each of the 
three, plant tissue harvesting date for a total of 162 experimental units. 

[0055] Seeds for each entry were germinated in germination bags for five 
days. Seedlings for each entry were transplanted into steam sterilized fine sand. For 
the 30d harvest treatment, five seedlings were transplanted into each pot. Pots used 
for the 30d harvest treatment were 25.5cm in diameter. Four seedlings per pot were 
transplanted for each entry in the 60d harvest treatment while the 90d treatment only 
used 3 seedlings per plot. Pot size was increased for both the 60 and 90d harvest 
treatments to 35.5cm diameter to facilitate larger plant growth. 

[0056] Sixty and 90d harvest blocks were treated with Miracle-Gro (8-7-6) 
(The Scotts Company, Marysville, Ohio, USA) 45d after transplanting to compensate 
for lack of nitrogen-fixing bacteria in the sterilized soil. Plants were watered daily to 
maintain optimum plant health. All blocks were bordered with a single row of 
25.5cm diameter pots containing three plants of the cultivar Magellan to minimize 
border effects. Inoculation 
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[0057] Soybean cyst nematode inoculum was prepared according to Rao 
Arelli et al. (1991). Individual plants were inoculated with approximately 5,000 SCN 
eggs each, two days following seedling transplantation. The SCN race isolate used 
for inoculation was cultured on the SCN susceptible cultivar Hutcheson' (Buss et al., 
1988) and was considered race 3. Two applications of 2500 eggs each were placed on 
opposite sides of individual plants in holes 1cm in diameter and approximately 5cm 
deep using an automatic pipetter (Brewer Automatic Pipetting Machine, Scientific 
Products, Baltimore, Maryland, USA). 

Harvesting of Plants: 

[0058] Plants were-harvested on 30d cycles following inoculation. Plants 
were harvested by cutting the pots and soaking them in water to allow the soil to 
slowly dissolve from the roots. Roots were then washed with pressurized water to 
remove cysts from roots and cysts were collected and counted under a 
stereomicroscope. It is important to note that roots from entries not receiving 
inoculation were also washed to insure uniformity among root tissue samples. 
Following root washing, plant tissue from each entry was separated according to leaf 
stem, and root tissue. In the 90d harvest treatment, pod tissue was also harvested 
from each sample. Plant tissue was collected into cloth bags and frozen at 20°C and 
then freeze dried. Once samples were dry, they remained frozen at -20°C until they 
were ground. 

Example 2 

Preparation and Analysis of Tissue Samples 
NIR Sample Preparation and Analysis: 

[0059] Dried plant tissue samples were ground to pass through a 1-mm screen 
using a cyclone type grinder. Ground samples were then placed into clear, sealable 
plastic bags and returned to the freezer (-20°C) to await spectral analysis. 
Approximately 7g of ground plant tissue were used for NIR analysis. Tissue samples 
were analyzed using a Foss NIR Systems Model 6500 (Foss NIR Systems, Silver 
Spring, Maryland, USA). The instrument scanned each sample with radiation from 
400 to 2498nm in wavelength and log 1/reflectance (Log 1/R) was recorded at 2nm 
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intervals. Spectral data were trimmed to include just the near infrared portion of the 
spectrum, which ranges from 1100 to 2498nm, to remove variations due to different 
tissue color. 

Data Analysis: 

[0060] Cysts collected from entries that were inoculated with SCN were 
counted and female index (FI) (formerly called index of parasitism) was calculated 
according to Golden et al. (1970) and Schmitt and Shannon (1992). This data was 
analyzed to confirm established reactions of cultivars, to SCN and to confirm that the 
race isolate used for inoculation was race 3. 

Visualization of Spectra: 

[0061] Near infrared reflectance spectra were collected and analyzed using 
WinlSI II - Version 1.01 software (Foss NIRSystems, Silver Spring, Maryland, 
USA). Reflectance data was plotted every two wavelengths Across the entire 
spectrum. Math treatment of spectral data consisted of 2,4,4,1 for derivative order, 
gap, first smoothing, and second smoothing, respectively. The software then allowed 
a correlation curve to be plotted over the reflectance curves showing correlation 
between reference values and spectra at every wavelength. Individual peaks in the 
spectra showing strong correlation to the reference data could then be identified. A 
protocol for analyzing spectra by averaging the reflectance data of like samples to 
amplify differences between unlike samples was proposed by Dyer and Feng (1995). 
Using this protocol, replications within cultivars were averaged and analyzed for 
visual differences in the spectra. 

[0062] Reference data used for correlation was derived from bioassay results 
collected at the time of plant tissue harvest. Female index was calculated from actual 
cyst counts and used for reference data. Female index is calculated by dividing the 
number of cysts found on a given cultivar by the number of cysts found on the 
susceptible cultivar Essex and multiplying by 100. 

[0063] Spectral data was analyzed visually to determine if differences 
between SCN resistant and susceptible lines could be detected by MRS. Figure 1 
contains the entire population plotted together in red and the correlation of spectral 
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data to FI in black. Several peaks show fairly strong negative or positive correlation 
to M. Figure 2 contains a close-up view of the wavelengths surrounding the 
wavelength at 2127nm. The black line shows a fairly strong positive correlation 
between Fl and NIR absorption in this region of the spectrum. The broad red band 
shows the continuous variation of the population of 132 entries in this region of the 
spectrum. Studies by Dyer and Feng (1995) and Roberts et al. (1997) suggest 
developing a population by averaging spectral data of like samples together to 
maximize differences between groups while maintaining the variation of the original 
population. Figure 3 contains the entire spectrum for a population of 12 samples 
resulting from averaging original population in groups of 11 samples. The resulting 
population contained 7 SCN resistant entries and 5 susceptible entries. Once again 
the red line is the entire population plotted together while the black line shows 
correlation of spectral data to FI. Correlations at individual peaks were much stronger 
than those in the original population (Fig. 1). However, the peaks were consistent 
between the two populations. Eight peaks beyond 1900nm were identified as having 
strong correlation with SCN resistance. Peaks below 1900nm are likely associated 
with water or overtones of the peaks above 1900nm. Averaging spectral data together 
amplified differences between SCN resistant and susceptible entries. Figures 4 and 5 
contain close-ups of regions showing separation between SCN resistant and 
susceptible entries. Data in Fig. 4 shows distinct separation between SCN resistant 
and susceptible entries at 221onm wavelength. Data in Fig. 5 shows distinct 
separation between SCN resistant and susceptible entries at 2126nm and again at 
about 2135nm wavelength. 

Discriminate Analysis: 

[0064] Because all cultivars used in this experiment are classified as either 
completely resistant to SCN or completely susceptible, a discriminate analysis was 
performed to try to distinguish between SCN resistance and susceptibility using NIR 
data. Cultivars commonly known as SCN resistant were assigned a reference value of 
one and SCN susceptible cultivars were assigned a value of two. A discriminate 
analysis was also performed to distinguish between plants that had been inoculated 
with SCN and those that had not. Entries receiving SCN inoculation were assigned a 
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reference value of one and entries that did not receive SCN inoculation were assigned 
a value of two. Discriminate analysis used partial least squares regression to 
discriminate samples on either the basis of inoculation or SCN resistance according to 
the spectral data from each sample. Each sample was then categorized as correctly 
falling into its assigned group, incorrectly falling into its assigned group or uncertain 
as spectral data indicates that the sample could fall into either group. First and second 
derivatives of log 1/R were used along with the math treatment of 4,4,1 for gap, 
smooth, and second smooth respectively, however, scatter correction was not used in 
any discriminate analysis calibrations. 

Calibration Equation Development: 

[0065] Calibration equations for the purpose of predicting SCN resistance 
were developed. Known reference data collected from bioassay analysis as well as 
commonly used SCN resistance classifications used in discriminate analysis were 
regressed against spectral data. 

[0066] Regression equations were calculated by regressing spectral data 
against reference values calculated from external data sources such as the SCN 
bioassay. These equations were developed for purpose of predicting SCN resistance 
in samples with unknown genotypes. Four different reference values were assigned to 
each sample based on the SCN bioassay and the pedigree of each sample. The 
reference values were as follows: Pedigree value 1 =samples with both parents SM 
resistant, 2=samples with one parent SCN resistant and one parent SCN susceptible, 
3=samples with both parents SCN susceptible; SCN bioassay 1 =resistant; 
2=moderately resistant; 3=moderately susceptible, 4=susceptible. Female index and 
actual cyst counts were also used as reference values. 

[0067] Equations were developed using a modified partial least squares 
(MPLS) regression. The first and second derivatives of log 1/R each with and without 
scatter correction were used in developing calibration equations. Math treatments of 
the spectral data consisted of 4 for gap, 4 for smooth, and 1 for second smooth. This 
initial analysis resulted in four separate equations for each source of reference. For 
each equation, 4 cross validation groups were used and 2 oudier passes were 



Docket: 399483 



16 



performed. Outliers were selected on the basis of critical T = 2.5 and critical H'= 
10.0. Calibration equations were evaluated on the basis of coefficient for 
determination of calibration (R 2 ), standard error of calibration (SEC), standard error 
of cross validation (SECV), and validation accuracy (1-VR). 

Bioassay: 

[0068] Female index levels were calculated from entries that were inoculated 
with SCN at each plant-tissue harvest date. Data in Table 1 shows the average 
number of cysts per plant and FI for each of the three plant-tissue harvest dates. 
While data from the 60d and 90d harvest dates were consistent with the predictions, 
the 30d harvest date produced poor results. Essex, the susceptible control, only 
produced an average of 4.53 cysts per plant. Essex produced an average of 587.08 
and 4055.67 cysts per plant for the 60d and 90d harvest dates, respectively. When 
calculating FI, average number of cysts on Essex was set equal to 100. Average 
number of cysts were then divided by the average number of cysts on Essex and 
multiplied by 100. Female index levels on 60d and 90d harvest on SCN differential 
cultivars PI 88798, PI 90763, Peking, and Pickett are all less than 10 indicating that 
the SCN phenotype used was indeed race 3. Female index numbers on 30d harvest 
are inaccurate due to low average number of cysts on Essex. It is also important to 
note the substantial difference in FI for the two near isogenic lines, NIL 1 and NIL 2. 
These data confirm that the two lines have different SCN reactions. 

Storing of Spectra: 

[0069] Thirty-day tissue harvest failed to produce enough plant tissue to 
adequately be analyzed by NIRS. Irregular results from the SCN bioassay were 
withdrawn from analysis. However, 60d and 90d harvests produced ample plant 
tissue from each root, stem, leaf, and pod for NIRS analysis. 

Visualization of Spectra: 

[0070] Visual analysis allowed the relationship between SCN resistance and 
spectral data to be identified. The relationship between inoculation treatment and 
spectral data was also identified. Correlations between SCN resistance and spectral 
data and correlations between inoculation treatment and spectral data were analyzed 
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for each plant tissue sampled. Results for stem and pod tissue mirrored those for leaf 
tissue but differences did exist between results gathered for root tissue and the above 
ground tissues. Only leaf and root tissue will be discussed. 

[0071] Data in Figure 1 shows the relationship between reflectance and 
inoculation treatment. Reflectance data is shown by the thick red curve, which is 
actually data from all the samples plotted together. Correlation between inoculation 
treatment and reflectance data is shown by the black curve. The y-axis shows the 
amount of correlation ranging from —1 .0 to 1 .0. The correlation curve is fairly 
constant deviating little from zero, which is no correlation. This suggests little effect 
of inoculation on reflectance data. Several peaks with correlation of 0.5 or -0.5 can 
be identified suggesting a relationship can be drawn between SCN resistance and 
spectral data of soybean leaf tissue (Figure 2). Figures 3 and 4 contain reflectance 
data from root tissue harvested from the same plants. Data in Figure 3 shows 
correlation between inoculation treatment and reflectance data while data in Fig. 4 
shows correlation between reflectance and SCN resistance. Again, little correlation 
exists between inoculation treatment and reflectance data (Figure 3). Surprisingly, 
not as many peaks in root tissue strongly correlated with SCN resistance as in leaf 
tissue (Figure 4). 

[0072] Spectra were averaged across replications for each tissue type and 
plotted against correlation data using FI in an attempt to amplify differences between 
SCN resistance and susceptibility using actual measurements of resistance. Figures 5 
and 6 contain correlation of FI with root (Fig. 5) and leaf (Fig. 6) tissue harvested 60 
days after SCN inoculation. Both figures show a number of peaks with strong 
positive and negative correlation to FL. It is important to notice the difference in 
correlation between data in Figure 4 and Figure 5. Data in Figure 5 shows much 
stronger correlations between spectra and FI than Figure 4 showing a simple 
correlation between SCN resistance classification and spectra. 

Discriminate Analysis: 

[0073] Results from discriminate analysis mirrored results from MPLS 
analysis. For discriminate analysis, population consisting of averaged spectra from 
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leaf and stem tissue for both 60 and 90-day harvests was used. Using spectral data to 
discriminate between SCN resistant and susceptible cultivars resulted in no 
misclassifications and 27 samples uncertain as to which classification they belonged 
in. The data in Table 2 shows R 2 and 1-VR values were relatively high at 0.88 and 
0.71, respectively. Discriminate analysis performed with the same population on the 
basis of whether or not plants were inoculated failed to produce a significant 
correlation between spectral data and presence of the SCN pathogen. 

Calibration Equation Development: 

[0074] Initially, plant tissue spectral data was separated by tissue type and 
harvest date. Modified partial least squares regressions performed on these 
populations showed that no correlation existed between spectral data and whether or 
not plants were inoculated with SCN. Spectral data from root tissue also failed to 
produce a correlation between spectral data and SCN reaction- Spectral data from 60d 
and 90d leaf and stem tissue and 90d pod tissue produce weak correlations between 
spectral data and SCN reaction (Table 3). It is believed that weak correlations were 
possibly due to low population size. Given the consistency in SCN reaction as 
evident by SCN bioassay data, leaf and stem data were averaged together and 60d and 
90d populations were merged into one. This provided a population size of 82 
individual plants. As a result, stronger correlations existed as evident by higher R 
and 1-VR values (Table 4). 

[0075] Final attempts to increase population size were performed by analyzing 
individual tissues from leaf, stem, and pod for 60d and 90d together. Final 
population size was 205 samples. While the possibility of calibration equations being 
over fit exists with the inclusion of more than one individual sample from the same 
plants, MTLS regressions once again showed a strong correlation between SCN 
resistance and spectral data, and R 2 1-VR values were high (Table 4). 

[0076] Data is provided hereinbelow in Tables 1-10, and spectra in Figures 1- 

12. 

[0077] The results concluded that NIRS could distinguish soybean lines on the 
basis of SCN resistance using ground soybean seed and ground soybean plant tissue. 
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[0078] Regression analysis of NIR data on the basis of inoculation treatment 
show that presence of SCN is not required to distinguish between SCN resistant and 
susceptible genotypes. Visual correlation analysis of spectra also confirm the lack of 
interaction between inoculation and spectra. This allows breeders to screen 
populations of experimental lines without having to inoculate them with SCN. 

[ 007 9 ] Near infrared spectroscopy provides an alternative to bioassay 
methods as well as DNA-based markers for selection of SCN resistant genotypes. 
The data strongly suggests that differences in plant tissue due to SCN resistance exist, 
and those differences may be found in the NIR spectra. Discrimination between SCN 
resistant and susceptible soybean lines using infrared spectroscopy was successful. 
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Table 9 



SCN Treatment 




SCN Inoculated 


Non-Inoculated 




Tissue Harvest 


30 days 


60 days 


90 days 


30 days 


60 days 


90 days 


NIL1 


3 reps 


3 reps 


3 reps 


3 reps 


3 reps 


3 reps 


NIL 2 


3 reps 


3 reps 


3 reps 


3 reps 


3 reps 


3 reps 


Essex 


3 reps 


3 reps 


3 reps 


3 reps 


3 reps 


3 reps 


P.I. 88788 


3 reps 


3 reps 


3 reps 


3 reps 


3 reps 


3 reps 


Peking 


3 reps 


3 reps 


3 reps 


3 reps 


3 reps 


3 reps 


P J. 90763 


3 reps 


3 reps 


3 reps 


3 reps 


3 reps 


3 reps 


Pickett 


3 reps 


3 reps 


3 reps 


3 reps 


3 reps 


3 reps 


Pi 437654 


3 reps 


3 reps 


3 reps 


3 reps 


3 reps 


3 reps 


Magellan 


3 reps 


3 reps 


3 reps 


3 reps 


3 reps 


3 reps 


Totals 


27 


27 


27 


27 


27 


27 



162 Total Plots* 



* 162 plots harvested for root, stem, and leaf tissue for 30 and 60 day treatments with pod tissue added to 90 day treatments 
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Table 10. 



Entry 



Plant 



Plant 



Plant 
3 



Plant 
4 



Plant 
5 



Plant 
6 



Average 
Number of 
Cysts per 



SCN 



FI 



SCN 
Pedigree 



1 


4 


7 


4 


1 




a nn 
4.0U 


A OA 
4.ZO 


D AC V O PC 

neb a ixCb 


2 


0 


0 


2 


5 


0 


1 Af\ 

1.40 


1 AQ 


L Res x Res 


3 


1 


5 


1 


7 


5 


"* OA 
3.oO 


A ftA 
4.U4 


1 Doc v Dpc 
. KCS X IxCa 


4 


3 


2 


5 


O 


1 


'X Aft 
3.4U 


AO 
3.0Z 


D ac v Dpc 
KCS X KCS 


5 


0 


2 


1 


0 


0 


ft Aft 

o.oo 


ft AA 
U.04 


I Res x Res 


6 


2 


3 


2 


0 


2 


1 Qft 

l.oU 


1 Ol 
1.5*1 


L KCS X KCS 


7 


2 


0 


3 


2 




1 *7< 


1 QA 
l.oO 


1 Dpc v Dag 


8 


1 


0 


4 






1 AT 
1.0/ 


1 HH 
1. / / 


f Dpc v Dac 


9 


0 


7 


5 


5 


3 


A ftft 

4.UU 


>* OA 
4.Z0 


L Res x Res 


10 


1 


0 


2 


1 


1 


1 AA 

1.00 


1 a/; 
1.06 


1 Dac v Dac 

l Kes x Kes 


11 


7 


13 


0 


9 


12 


o.ZO 


O TO 

o. /Z 


L Res x Res 


12 


1 


0 


6 


2 


2 


o on 
2.20 


O 1A 

Z.34 


1 D AO -V D AC 

L KcS X KcS 


13 


0 


0 


1 


0 


0 


a oa 
O.ZO 


A 01 

0.Z1 


L Res x Res 


14 


1 


1 


2 


6 


0 


2.00 


O 1 1 

2.13 


L Kes x Kes 


15 


0 


0 


1 






ft Ol 

0.33 


A OC 


L Res x Res 


16 


2 


0 


0 


2 


1 


1.00 


1.06 


1 O j w - v D AC. 

l Kes x Kes 


17 


1 


0 


2 


2 




1 oc 
I.Zj 


1.33 


1 Res x Res 


18 


2 


0 


8 


2 


0 


2.40 


o cc 


1 Dac v Dac 

i Kes x Kes 


19 


2 


4 


1 


1 


0 


1.60 


1.70 


L Res x Res 


20 


2 


0 


2 


1 




1.25 


1 oo 
1.33 


L Res x Res 


21 


1 


2 


0 


0 




A OC 

0.75 


A OA 

O.oO 


L Res x Res 


22 


2 


2 


1 


. 4 




o o< 

Z.Zj 


O OA 


L Res x Res 


23 


2 


7 


6 






5.00 


c oo 
5.32 


1 Res x Res 


24 


0 


5 


1 






2.00 


O 1 1 

Z.13 


1 Res x Res 


25 


3 


3 


2 






o «co 
2.67 


O Qvt 

Z.o4 


1 Res x Res 


26 


1 


2 


1 


1 


3 


1 iCA 
1.00 


1 OA 
l./O 


1 Res x Res 


27 


3 


0 


0 


3 




1.50 


1 iCA 

1.60 


1 Res x Res 


28 


5 


2 


1 


2 




2.50 


o « 
Z.60 


1 Res x Res 


29 


3 


0 


2 


2 


3 


2.00 


2.13 


1 Res x Res 


30 


1 


0 


0 


2 


1 


A DA 

O.oO 


A Q< 

O.oO 


1 Res x Res 


31 


3 


11 


0 


4 




4.50 


A OA 


1 Res x Res 


32 


0 


3 


0 






1 AA 

1.00 


1 A#£ 

l.Oo 


1 Res x Res 


33 


0 


0 


1 


0 




A OC 

0.25 


A OO 

0.2 / 


1 Dap tt Dac? 

l Kes x Kes 


34 


2 


10 


0 


3 


2 


1 Af\ 

3.40 


O AO 

3.0Z 


1 Res x Res 


35 


1 


2 


2 


4 


0 


1 OA 

1.80 


1 01 

i.yi 


1 Res x Res 


36 


1 


1 


2 


1 




1 oc 
1.23 


1 oo 
1.33 


1 Res x Res 


37 


0 


2 


2 






1.33 


1 A O 

1.4Z 


1 Res x Res 


38 


7 


2 


1 


1 




2.75 


O AO 

2.V3 


1 Res x Res 


39 


0 


5 


9 


6 


9 


C OA 


A 1 O 
0.1 / 


1 D PC V D AC 

l Kes x Kes 


40 


13 


4 


5 


1 


3 


C OA 

5.20 


C CO 

5.53 


1 Res x Res 


41 


1 


0 


1 


1 


2 


1 AA 
1.00 


1 ftA 

1.00 


1 Res x Res 


42 


3 


2 


4 


0 


0 


1 OA 

l.oO 


1 Q1 


1 Res x Res 


43 


0 


2 


3 


0 




1.25 


1.33 


1 Res x Res 


44 


7 


3 


•2 


1 




3.25 


3.46 


1 Res x Res 


45 


1 


5 


2 


2 


5 


3.00 


3.19 


1 Res x Res 


46 


8 


0 


10 






6.00 


6.38 


1 Res x Res 


47 


1 


3 


5 


3 




3.00 


3.19 


1 Res x Res 


48 


4 


4 


1 


6 




3.75 


3.99 


1 Res x Res 


49 


4 


2 


25 


0 


2 


6.60 


7.02 


1 Res x Res 


50 


1 


4 


4 


0 


3 


2.40 


2.55 


1 Res x Res 


51 


1 


1 


5 






2.33 


2.48 


1 Res x Res 


52 


8 


6 


2 


12 


3 


6.20 


6.60 


1 Res x Res 


53 


19 


6 


10 


4 


7 


9.20 


9.79 


1 Res x Res 


54 


7 


5 


11 


4 


1 


5.60 


5.96 


1 Res x Res 



Q 
^3 

m 
m 

EG 



o 

!?=!! 



Docket: 399483 



32 



o 



■0 



01 

103 















Plant 


Average 
Number of 




SCN 


SCN 


Entry 
# 


Plant 
i 


Plant 

2 


Plant 

3 


Plant 
4 


Plant 
5 


6 


Cysts per 
Plant 


FI 


Resistance 
Rating 


Pedigree 


cc 

jj 


16 


10 


3 


10 


2 




8.20 


8.72 


1 


Res x Res 


^A 

JO 


3 


6 


2 


6 


0 




3.40 


3.62 


1 


Res x Res 




2 




o 


5 


6 




2.80 


2.98 


1 


Res x Res 


JO 


2 


1 


2 








1.33 


1.42 


1 


Res x Res 


SQ 


o 


o 


1 


18 


5 




5.60 


5.96 


1 


Res x Res 


An 
ou 


1 1 


4 


1 


6 


2 


8 


5.17 


5.50 


1 


Res x Res 


61 


3 


3 


1 ^ 

Ij 


7 


7 




8.20 


8.72 


! 


Res x Res 


62 


i 
3 


Q 


o 








5.00 


5.32 


1 


Res x Res 


63 


1U 


0 


•J 


9 


11 




7.00 


7.45 


! 


Res x Res 


64 
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[0080] All publications and patent applications mentioned in the specification 
are indicative of the level of those skilled in the art to which this invention pertains. 
All publications and patent applications are herein incorporated by reference to the 
same extent as if each individual publication or patent application was specifically 
and individually indicated to be incorporated by reference. 
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